A Vocabulary-independent Keyword Spotter for Spontaneous Chinese Speech
نویسنده
چکیده
HarkMan keyword-spotter was designed so that it can be used in a real-world environment to automatically spot the given words of a vocabulary-independent (VIND) task in unconstrained Chinese telephone speech. In this spotter, the speaking manner and the number of keywords are not limited. This paper focuses on a novel technique that addresses acoustic modeling, keyword-spotting network, search strategies, robustness, and rejection adopted in HarkMan. The underlying technologies used in HarkMan given in this paper are not only for keyword spotting but also for continuous speech recognition, which had been proved very efficient. It achieved the figure-of-merit (FOM) value over 90%.
منابع مشابه
A Study on Out-of-vocabulary Word Modeling for a Segment-based Keyword Spotting System
The purpose of a word spotting system is to detect a certain set of keywords in continuous speech. The most common approach consists of models of the keywords augmented with \ ller," or \garbage" models, that are trained to account for non-keyword speech and background noise. Another approach is to use a large vocabulary continuous speech recognition system (LVCSR) to produce the most likely hy...
متن کاملAn Effective Approach for Chinese Speech Recognition on Small size of Vocabulary
In this paper, an effective approach for Chinese speech recognition on small vocabulary size is proposed the independent speech recognition of Chinese words based on Hidden Markov Model (HMM). The features of speech words are generated by sub-syllable of Chinese characters. Total 640 speech samples are recorded by 4 native males and 4 females with frequently speaking ability. The preliminary re...
متن کاملKeyword spotting enhancement for video soundtrack indexing
Multimedia databases contain an increasing amount of videos that are hardly semantically accessed. Among the useful indices that can be extracted from the sound track, the presence of a keyword at some place plays a prominent role. This paper deals with the specificities of such a keyword spotter and the enhancement brought to our previous technique, [1] based on frame labeling. To be useful, s...
متن کاملKeyword Spotting Enhancement for Video Sountrack Indexing
Multimedia databases contain an increasing amount of videos that are hardly semantically accessed. Among the useful indices that can be extracted from the sound track, the presence of a keyword at some place plays a prominent role. This paper deals with the specificities of such a keyword spotter and the enhancement brought to our previous technique, [1] based on frame labeling. To be useful, s...
متن کاملImproving Task Independent Utterance Verification Based on On-line Garbage Phoneme Likelihood
Utterance verification based on on-line garbage (OLG) models is often adopted as the benchmark method. However, we find its performance can be remarkably improved by fine-tuning. In this study, OLG phoneme likelihood is proposed. It achieves much better performance and efficiency for task independent utterance verification to reject mis-recognition and OOV utterances than the OLG frame likeliho...
متن کامل